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DNA polymerases need to be engineered to achieve optimal performance for biotecli- 
nological applications, which often require high fidelity replication when using modified 
nucleotides and when replicating difficult DNA sequences. These tasks are achieved for 
the bacteriophage T4 DNA polymerase by replacing leucine with methionine in the highly 
conserved Motif A sequence (L412M). The costs are minimal. Although base substitution 
errors increase moderately, accuracy is maintained for templates with mono- and dinu- 
cleotide repeats while replication efficiency is enhanced. The L412M substitution increases 
intrinsic processivity and addition of phage T4 clamp and single-stranded DNA binding 
proteins further enhance the ability of the phageT4 L412I\/1-DNA polymerase to replicate all 
types of difficult DNA sequences. Increased pyrophosphorolysis is a drawback of increased 
processivity, but pyrophosphorolysis is curbed by adding an inorganic pyrophosphatase or 
divalent metal cations, l\/ln^+ or Ca^+. In the absence of pyrophosphorolysis inhibitors, 
theT4 L412M-DNA polymerase catalyzed sequence-dependent pyrophosphorolysis under 
DNA sequencing conditions. The sequence specificity of the pyrophosphorolysis reac- 
tion provides insights into how the T4 DNA polymerase switches between nucleotide 
incorporation, pyrophosphorolysis and proofreading pathways. The L-to-M substitution was 
also tested in the yeast DNA polymerases delta and alpha. Because the mutant DNA 
polymerases displayed similar characteristics, we propose that amino acid substitutions 
in Motif A have the potential to increase processivity and to enhance performance in 
biotechnological applications. An underlying theme in this chapter is the use of genetic 
methods to identify mutant DNA polymerases with potential for use in current and future 
biotechnological applications. 
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INTRODUCTION 

The remarkable advances made in molecular biology in the last 
40 years are dependent to a large extent on DNA polymerase- 
dependent methods. Researchers who pioneered the use of DNA 
polymerases for DNA sequencing, PGR and site-directed muta- 
genesis were awarded Nobel prizes for their ground-breaking 
achievements, but DNA polymerases play key roles in numer- 
ous additional indispensable methods including DNA labeling, 
cloning, whole genome amplification, and diagnostic tech- 
niques. Researchers, however, recognized from the earliest days 
that some DNA polymerase activities are counter-productive 
for in vitro applications. Nuclease activities, for example, can 
degrade primers and the newly synthesized DNA, yet some exonu- 
cleolytic proofreading activity is needed for high fidelity DNA 
replication. Other DNA polymerase activities need to be modi- 
fied, for example the ability to utilize non-standard nucleotides 
and to replicate "difficult" DNA templates with simple repeats 
or sequences that are excessively rich in A-l-T or G-l-C base 
pairs. DNA polymerases from several organisms have been subject 
to extensive engineering in order to remove or curb unwanted 



activities and to modify or enhance others that are required 
for optimal performance in vitro (see several chapters in this 
volume and reviews by Hamilton etal., 2001; Reha-Krantz, 
2008). 

One of the most chaOenging tasks has been to engineer 
DNA polymerases to replicate difficult DNA sequences. DNA 
polymerases stall and often dissociate at difficult template 
sequences and these sequences are frequently sites for repli- 
cation errors. For example (Streisinger etal, 1966; Streisinger 
and Owen, 1985) observed that simple repeat sequences in 
phage T4 genes are hotspots in vivo for frameshift muta- 
tions, which are typically insertions or deletions within mono- 
and di-nucleotide repeat sequences. The length of the repeat 
sequence affects the likelihood of frameshift mutations as 
hotspot mutation sites have longer repeat tracks than colder, 
less mutable sites. Streisinger etal. (1966) and Streisinger and 
Owen (1985) that frameshift mutations are created by tran- 
sient separation of the primer and template strands followed 
by strand misalignment during reannealing to create an inter- 
mediate with an unpaired repeat in the template or primer 
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Strand, but with a correctly paired primer-template terminus 
that can be extended by a DNA polymerase. If the misaligned 
DNA strands are not corrected before the next round of chro- 
mosome replication, the repeat sequence will be expanded 
(if the unpaired repeat is in the primer strand) or con- 
tracted (if the unpaired repeat is in the template strand). As 
expected, these "slippery" DNA templates are also difficult to 
replicate in vitro as they slow or even prevent further repli- 
cation and are sites for insertions and deletions (Rao, 1994; 
Fidalgo da Silva and Reha-Krantz, 2000; Clarke etal, 2001; 
Fazekas etal, 2010). Kunkel etal. (1994) demonstrated that 
-M insertions in homopolymeric runs dramatically increase in 
reactions with the phage T7 DNA polymerase in the absence 
of thioredoxin, the T7 DNA polymerase processivity factor. 
Thus, decreased DNA polymerase processivity is correlated 
with increased strand misalignment mutagenesis, presumably 
by increasing DNA polymerase dissociation which provides an 
opportunity for the free primer-end to spontaneously denature 
and then to re-anneal in a misaligned configuration. This pro- 
posal is supported by other studies, for example, Fazekas etal. 
(2010). 

We observed that the leucine to methionine (L412M) substi- 
tution in the conserved Motif A (Table 1) of the bacteriophage 
T4 DNA polymerase produced a mutant DNA polymerase that 
has improved ability to replicate difficult DNA sequences under 
typical DNA sequencing conditions. Furthermore, the L412M- 
DNA polymerase has increased ability to incorporate and extend 
modified nucleotides. Both activities are facilitated by increased 
intrinsic processivity (Reha-Krantz and Nonay, 1994), as explained 
below. 

The L412M substitution in the phage T4 DNA polymerase 
was identified by a genetic selection strategy as a second-site 
amino acid substitution that suppressed the excessive proof- 
reading observed for several mutant DNA polymerases (Stocki 
etal, 1995; Li etal, 2010). As expected, the L412M substitu- 
tion reduces proofreading activity, but base substitution muta- 
tions are increased moderately 10- to 40-fold, while frameshift 
mutations by only threefold at most (Table 2). Thus, high 
fidelity DNA replication observed for the wild type T4 DNA 
polymerase, about one error in 10''-10^ nucleotides incorpo- 
rated (Kunkel etal., 1984), is essentially retained; hence, the T4 
L412M-DNA polymerase is an excellent candidate polymerase 
for single-molecule sequencing and other applications where 
error-free replication is required. Note that amino acid substi- 
tutions in Motif A of DNA polymerases from several organisms 
have been observed to cause minor to major changes in replica- 
tion fidelity, both increasing and decreasing accuracy (Table 2; 
Reha-Krantz and Nonay, 1994; Patel and Loeb, 2000; Patel 
etal, 2001; Fidalgo da Silva etal, 2002; Niimi etal, 2004; Li 
etal, 2005; Venkatesan etal, 2006, 2007; Pursell etal., 2007; 
Sakamoto etal., 2007; Nick McElhinny etal., 2008; Zhong etal., 
2008). 

We review the development of the phage T4 L412M-DNA 
polymerase as an important DNA sequencing tool and its use 
in other applications. There are useful lessons to be learned 
because replication of difficult DNA templates remains salient 
with current DNA sequencing and amplification technologies. 



Because amino acid substitutions in Motif A of the bacterio- 
phage T4 DNA polymerase have profound effects on intrinsic 
processivity (Figure 1), we propose that amino acid substi- 
tutions in Motif A of other DNA polymerases, especially the 
L-to-M substitution in family B DNA polymerases, have the 
potential to increase processivity and enhance DNA polymerase 
performance in biotechnological applications. A strategy to 
"evolve" mutant DNA polymerases with increased processivity 
and other desirable activities for in vitro applications (Bourn 
etal., 2011) is compared to our genetic evolution strategies. 
We also present new data about pyrophosphorolysis activity, a 
byproduct of increased intrinsic processivity. While the scope of 
this chapter is limited to methods used to enhance the intrin- 
sic processivity of the phage T4 DNA polymerase, information 
about the naturally highly processive ^29 DNA polymerase is 
presented by Blanco etal. (1983) and Eid etal. (2009). The 
association of thioredoxin with the T7 DNA polymerase and 
the development of Sequenase^'^ is presented by Zhu (2014). 
Increased processivity produced by fusing DNA-binding domains 
to DNA polymerases is discussed by Arezi etal. (2014) in this 
volume. 

MATERIALS AND METHODS 

DNA POLYMERASES AND DNA POLYMERASE ACCESSORY PROTEINS 

Expression, purification, and characterization of wild type and 
mutant T4 DNA polymerases were described previously (Lin et al., 
1987; Spicer etal, 1988; Reha-Krantz etal., 1993). The T4 gp45 
clamp and clamp loading proteins (gp 44/gp62) and the T4 single- 
stranded DNA (ssDNA) binding protein (gp32) were expressed 
and purified as described (Shamoo etal., 1986; Rush etal., 1989), 
but with modifications for large scale production. 

DNA POLYMERASE INTRINSIC PROCESSIVITY 

The intrinsic processivity of wild type and mutant phage T4 
DNA polymerases was determined in the presence of a heparin 
trap as described by Reha-Krantz and Nonay (1994). Briefly, 
20 (il reactions contained 25 mM HEPES, pH 7.5, 60 mM 
NaOAc, 1 mM dithiothreitol, 0.5 mM EDTA, 80 \iM. dNTPs, 
0.2 mg/ml bovine serum albumin, 7.5 nM primed single- 
stranded circular M13 DNA (expressed as 3' -primer ends), 
and 150 nM DNA polymerase. Reactions were pre-incubated 

5 min at 30°C and started by adding a solution of Mg^+ [Cf, 

6 mM Mg(OAc)2] and heparin (Cf, 0.1 mg/ml). Reactions 
were incubated 15 s at 30° C and stopped by the addition of 
2 (il 0.2 M EDTA. Reaction products were separated on DNA 
sequencing gels (7% acrylamide, 8 M urea) and the ^^P-labeled 
products were visualized by exposure to Kodak X-Omat AR 
film. 

PYROPHOSPHOROLYSIS ASSAY CONDITIONS AND INHIBITION BY 
Mn2+ AND Ca2+ IONS 

Pyrophosphorolysis activity was measured for the exonuclease- 
deficient D112A/E114A/L412M-DNA polymerase in 12 [i\ reac- 
tions containing partially digested duplex DNA that was 3'- 
labeled with [^-P]dCMP (50 pmol of labeled 3'-ends/reaction), 
10 nM DNA polymerase, 67 mM Tris-HCl (pH 7.5), 16.7 mM 
(NH4)2S04, 0.5 mM DTT, 167 (i,g BSA/ml, 6.7 mM MgCl2, and 
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Table 1 | Motif A in Phage T4 DNA polymerase and yeast DNA polymerases. 



DNA pol 


Motif A s6C|U6nc6 


Phpnntvnp^ 


flRntlTA \A/ilrl t\/nci 


^Uo U IL oLT roll 


\A/ilrl t\/no 


T4 L412M 


M 


Weak mutator^, strongly PAA-sensitive^ 


T4 1417V 


V 


Antimutator^, sensitive to reduced dGTP^ 


T4 L412M/I417V 


MV 


Antimutator^, PAA-sensitive 


T4 L412I 


1 


\A/oalt' anti m iitatnr^ cQnciti\/o tn tqHi ir'oH HfnXP^ 
vVtJclIx allLIIIIULa LUI , o tJ 1 lol LI VC LvJ 1 tJUUV-ztJU U vj 1 r 


T4S411T/L412M 


TM 


Weak antimutator^, sensitive to reduced dGTP^ 


Sc DNA pol S 


608 Dfn SLYPSIM 


Wildtype 


Sc L612M S 


M 


Mutator'', PAA-sensitive'' 


be LblzM/V/bolVIo ' 


M 


Wildtype fidelity, PAA-resistar^t 


Sc DNA pol a 


864DfnSLYPSII 


Wildtype 


Sc L868M-a 


M 


Mutator", PAA-sensitive'' 


Sc L868F-a 


F 


Mutator", PAA-sensitive'' 


Sc L868W-a 


W 


Weak mutator", PAA-resistant"' 



^Data from Table 2 and Reha-Krantz and Nonay (1994). 

'°Data from Li etal. (2005); mutator phenotype observed in the absence of mismatch repair 
^Data from Niimietal. (2004). 
^ Unpublished observations, LR-K. 
^V758M is near IVIotIf C. 
^ Li (2004). 



1 mM PPi. Reactions with Mn^+ or Ca^+ ions also contained 
15 mM Na citrate. The PPi concentration varied from 0.5 to 
6 mM and Mn^+ or Ca^"*^ ion concentrations varied from 0.25 
to 10 mM. Reactions were stopped by the addition of 2 [il 0.2 M 



EDTA. The product of pyrophosphorolysis, 



2p]dCTP was 



separated by thin layer chromatography on polyethyleneimine 
impregnated cellulose (PEI) plates. Samples (2 (il) of the reac- 
tions were applied 1 cm from the bottom of the plate. The plates 
were developed in 50% ethanol up to 5 cm, dried and devel- 
oped in 0.15 M KH2PO4 - 15% ethanol solvent until the solvent 
reached the top of the plate. Plates were dried and either the 
UV-absorbing spot was cut out and radioactivity was determined 
using a scintillation counter or the dried TLC plate was exposed 
to a Phosphorlmager screen (Molecular Dynamics). These con- 
ditions were also studied under DNA sequencing conditions (see 
below). 

Pyrophosphorolysis was also determined in assays using the 
fluorescence of the base analog, 2-aminopurine (2AP). The use 
of 2AP to study DNA polymerase function is described with 
detailed instruction by Reha-Krantz (2009). Briefly, 2AP fluores- 
cence in DNA is quenched by base-stacking interactions, but the 
2AP nucleotide is highly fluorescent. Thus, for primer-templates 
labeled at the 3'-end of a primer with 2AP, pyrophosphorolysis 
is detected as an increase in fluorescence intensity due to pro- 
duction of free 2AP deoxynucleoside triphosphate (d2APTP). 
Because 3'-exonuclease activity will also release the terminal 
2AP nucleotide, but as the deoxynucleoside monophosphate 
(d2APMP), DNA polymerases were engineered to be exonucle- 
ase deficient with the D112A/E114A amino acid substitutions 



which prevent Mg^+ binding in the exonuclease active site (Reha- 
Krantz and Nonay, 1993; Elisseeva etal., 1999). Stopped-flow 
experiments were performed with the Applied Photophysics SX. 1 8 
MV spectrofluorometer. Samples were excited at 310 nm; a 
335 nm cut-off filter was used. Temperature in the sample- 
handling unit was maintained at 20.0 ± 0.5°C. Reactions were 
initiated in the stopped flow by mixing equal volumes of a 
solution of 1400 nM exonuclease-deficient T4 DNA polymerase, 
400 nM 2AP-DNA substrate, 5 mM PPi, 25 mM HEPES (pH 
7.5), 2 mM DTT, 50 mM NaCl and 1 mM EDTA with a sec- 
ond solution of 16 mM MgCl2, 25 mM HEPES (pH 7.5), and 
50 mM NaCl. After mixing, the final concentrations of reac- 
tion components were 700 nM exonuclease-deficient T4 DNA 
polymerase, 200 nM 2AP-DNA, 2.5 mM PPi, 8 mM MgCh, 
1 mM DTT, 25 mM HEPES (pH 7.5), 50 mM NaCl, and 
0.5 mM EDTA. Between five and six determinations were per- 
formed for each reaction and mean values were calculated. 
The experimental traces were fit to either single or double 
exponential equations. The agreement of the curve fits was 
judged by analysis of the randomness of the distribution of 
residuals for curves generated by single or double exponen- 
tial equations. Two DNA substrates were used that differed 
in the A-l-T- or G-|-C-richness in the primer-terminal region. 
A+T DNA: primer, 5'GCACGTCATCGGTAATP; template, 
3'CGTGCAGTAGCCATTATGGATCGATGGTTT G+C DNA: 
primer, 5'GCACGTCATTAACGGTP; template, 3'CGTGCAG 
TAATTGCCATGGATCGATGGTTT. The A-^T and G+C primer 
terminal DNA sequences are indicated in bold font. P indi- 
cates 2AP 
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Table 2 | DNA replication fidelity by phageT4 wild type and mutant polymerases with amino acid substitutions in motif A. 

DNA polymerases Forward mutation Types of ac mutations (percentage) Base substitution reversion Frameshift reversion 

frequency^ frequency*" frequency"^ 



Wild type'' 



1/10^ 



GC^AT 

AT^GC 

GC^TA 

GC^CG 

AT^TA 

AT^CG 



Total frameshift 
Total other'' 



(18%) 

(12%) 

(1%) 

(3%) 

(5%) 

(1%) 



0.7/10'^ 



1/10^ 



Total base substitution 40% 



28% 
32% 



L412M'^ 



35/1 0^ 



GC^AT (37%) 

AT^GC (2%) 

GC^TA (13%) 

GC^CG (2%) 

AT^TA (8%) 

AT^CG (9%) 
Total base substitution 90% 

Total frameshift 3% 

Total other 7% 



9/1 0^5 



3/10'' 



1417V 

L412M/I417V 
L412I 

S411T/L412M 



nd' 
nd 
nd 
nd 



nd 
nd 
nd 
nd 



0.005/10S'9 
0.03/10'''9 

0.07/10^'S' 



nd 
nd 
nd 
nd 



^Mutations that confer acriflavin resistance in tlie ac gene (Wang and Ripiey, 1998). 
^ Reversion of rll 199oc to rll; mostly /\T— ► GC mutations (Reha-Krantz, 1995). 

Reversion of rlll31 by + 1 fs. 
^Data from several students, especially Alia Daoud and Fatima Kamal. 
^Total "other" are mostly deletions between direct repeats, 
^nd: not determined. 

9 Data from Reha-Krantz and Nonay (1994). 



DNA SEQUENCING WITH THE T4 EXONUCLEASE-DEFICIENT, 
L412M-DNA POLYMERASE HOLOENZYME COMPLEX: FIDELITY™ 

FIDELITY'T'^ (ONCOR, Gaithersburg, MD, USA) was a manual 
DNA sequencing kit that was marketed for general sequencing 
purposes but also for use to sequence difficult DNA templates. 
There are four steps. In the first step, primer (1 pmol), labeled 
at the 5'-end in reactions using T4 polynucleotide kinase and 
[y^^p]atp, was annealed in a 10 |xl reaction to 1 |xg ssDNA M13 
DNA in annealing buffer [25 mM Tris-HCl (pH 8.5), 20 mM 
MgCh, and 50 mM NaCl]. Reactions were held at 65° C for 
2 min and then cooled slowly over 30 min to 35° C. The reac- 
tions were then pulse-centrifuged and chilled briefly on ice. If 
the primer was not labeled, a second step was performed to 
label the primer DNA internally using [a^^S] or [ci(^^P]dATP at 
1500 Ci/mmol, 10 mCi/ml. Internal labeling reactions contained 
the 10 [i\ annealed DNA produced in the first step; 3 (i-l T4 reac- 
tion buffer [0.4 M Tris-HCl (pH 8.5), 40 mM MgCh, 40 mM DTT, 



0.4 mg/ml acetylated BSA; 4 mM ATP; 1.4 \xM each dTTR dGTP, 
dCTP]; exonuclease- deficient L412M-DNA polymerase (10 nM); 
and water to produce a final total volume of 18 [il. The reac- 
tion was incubated at 40 to 42° C for 15 min and then placed 
on ice. Primer elongation with chain terminators was carried 
out in the third step. A mixture (6 |xl) of T4 DNA polymerase 
accessory proteins (gp32, 2.7 mg/ml; gp44/62, 0.2 mg/ml; gp45, 
0.7 mg/ml) in 20 mM Tris-HCl (pH 8.0), 100 mM NaCl, 1 mM 
EDTA, 1 mM DTT, and 62% glycerol was added to the 18 (il of 
labeled DNA and mixed. 5.5 [xl of the elongation mix was added 
to 2 (il of the four separate termination reaction tubes. All ter- 
mination mixes contained 300 [iM dATR dTTP, dGTP, and dCTP 
that were supplemented with 40 [iM 3'NH2ddATP (A mix), or 
80 [iM 3'NH2ddTTP (T mix), or 40 [iM 3'NH2ddGTP (G mix), 
or 80 \lM 3'NH2ddCTP (C mix). The reactions were incubated at 
40 to 42°C for 5 min. In the last (4th step), reactions were stopped 
and proteins degraded by addition of 5 |xl of a solution containing 
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FIGURE 1 I Intrinsic processivity of bacteriop)iageT4 wild type and 
mutant DNA polymerases. Primer elongation was determined in the 
presence of a heparin trap as described (see Materials and Methods). 
Reaction products were separated under denaturing gel electrophoresis 
conditions. (A) Almost no primer extension was detected if heparin was 
added before the DNA polymerase in the control reaction, lane 4. 
Significant primer extension was observed in reactions with 80 |iM dNTPs: 
wild type T4 DNA polymerase, lane 1; L412M-DNA polymerase, lane 2; 
I417V-DNA polymerase, lane 3. Less primer extension was observed when 
dNTPs were reduced to 1 \lM: wild typeT4 DNA polymerase, lane 5; 
L412M-DNA polymerase, lane 6; I417V-DNA polymerase, lane 7. Prominent 
termination sites are indicated. (B) Short film exposure of the reactions in 
panel A. The figure is modified from Reha-Krantz and Nonay (1 994) and is 
shown with permission from the ASBMB. 



3 parts Stop Solution (95% formamide, 20 mM EDTA, 0.05% 
bromophenol blue, 0.05% xylene cyanol blue) and 1 part Pro- 
teinase K Solution (100 (ig/ml proteinase K in 50 mM Tris-HCl, 
pH 8.0). The reactions were mixed and incubated at 40°C for 
1 5 min. Samples were heated at 80° C immediately prior to loading 
samples for gel electrophoresis. 



Note, for reactions with inorganic pyrophosphatase (iPPase), 
the T4 DNA polymerase solution Eilso contained 8 U/ml yeast 
iPPase. Alternatively, pyrophosphorolysis was inhibited in reac- 
tions with 16.7 mM Mg^+, 15 mM Na citrate and either 2.5 mM 
Mn^+ or 10 mM C3?+. 

RESULTS 

BIOCHEIVIICAL CHARACTERIZATION OF THE BACTERIOPHAGE T4 
L412M-DNA POLYMERASE 
Increased intrinsic processivity 

DNA polymerase intrinsic processivity refers to the number of 
nucleotides incorporated during one enzyme-DNA association 
repUcation cycle, but intrinsic processivity is partially masked 
in vivo by the association of processivity factors. For many DNA 
polymerases that function at replication forks, a clamp protein 
(gp45 in phage T4, PCNA in eukaryotes and the beta sliding clamp 
in Escherichia coU) tethers the DNA polymerase to the DNA tem- 
plate. The gp45 and PCNA clamps are composed of three identical 
subunits that form a doughnut structure around duplex DNA. 
The clamp is proposed to function as a tool belt to tether one 
or more DNA polymerases or other proteins found at replication 
forks (Pages and Fuchs, 2002; Indiani et al., 2005; Moldovan et al., 
2007). Although researchers have known for years that tethering 
of the T4 DNA polymerase in holoenzyme complexes increases 
both polymerase and exonuclease activities, Yang etal. (2004) 
observed that the primer-template appears to transfer between 
two tethered polymerases during proofreading. Thus, apparent 
continuous replication by the tethered T4 DNA polymerase may 
involve coordinated replication by two DNA polymerases tethered 
to the same clamp (loyce, 2004). Tethering, however, does not fully 
compensate for differences in intrinsic processivity as explained 
below. 

Without tethering, the wild type T4 DNA polymerase has lit- 
tle ability to initiate or extend a primer (Figure 1; Reha-Krantz 
and Nonay, 1993, 1994; Spacciapoli and Nossal, 1994). In reac- 
tions with ^^P-labeled primer, single-stranded circular DNA and 
heparin to trap dissociated DNA polymerase, the vast majority of 
primers are not extended by the wild type T4 DNA polymerase 
and the few extensions detected are limited to incorporation of 
one or two nucleotides (Figure IB, lane 1). Longer extension 
products are visible with prolonged film exposure (Figure lA, 
lane 1), which means that most non-tethered T4 DNA poly- 
merase molecules cannot extend the primer, the few complexes 
that are competent for extension incorporate just one or two 
nucleotides before dissociation, and only a few of these complexes 
escape the initiation phase and commence primer elongation. 
However, for primer extension complexes that escape early dis- 
sociation, there are preferential termination sites as indicated by 
discrete bands after incorporation of 46, 112, and 252 nucleotides 
(Figure lA). Thus, the intrinsic processivity of the wild type 
T4 DNA changes from an initial very low or essentially non- 
processive state to a state with higher processivity during the 
elongation phase, but the elongating DNA polymerase remains 
sensitive to difficult DNA sequences as demonstrated by pref- 
erential termination sites. The role of the clamp then is in 
initiation and in assisting replication through difficult DNA 
sequences. 
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Amino acid substitutions in Motif A of the T4 DNA poly- 
merase (Table 1) dramatically alter intrinsic processivity. Longer 
extension products are visible for the L412M-DNA polymerase 
(Figure lA, lane 2), especially when concentrations of deoxyri- 
bonucleoside triphosphates (dNTPs) are reduced to 1 (iM 
(Figure 1, lane 6); thus, more L412M-DNA polymerase com- 
plexes are able to escape the initiation phase and to replicate 
past difficult sequences. Kinetic studies are consistent with 
these observations. In standing start primer extension assays, 
the for dAMP incorporation is lower for the L412M-DNA 
polymerase compared to wild type, 11 versus 16 (iM (Hari- 
haran etal., 2006), and the k^g rate is about twofold slower 
(Fidalgo da Silva etal, 2002). While the L412M substitution 
is the most conservative substitution, dramatic changes in the 
opposite direction are observed for the slightly less conser- 
vative L412I substitution. The L412I-DNA polymerase can- 
not replicate DNA when the dGTP pool is reduced and an 
antimutator phenotype is observed instead of the mutator phe- 
notype observed for the L412M-DNA polymerase (Table 1). 
These features resemble the consequences of another conserva- 
tive substitution in Motif A, 14 17V. The I417V-DNA polymerase 
is much less processive than the wild type T4 DNA poly- 
merase (Figure 1, lane 3), especially when dNTP pools are 
reduced (Figure 1, lane 7). Like the L412I-DNA polymerase, 
the I417V-DNA polymerase cannot replicate DNA when the 
dGTP pool is reduced (Reha-Krantz and Nonay, 1994; Stocki 
etal., 1995) and proofreading is increased as indicated by the 
antimutator phenotype (Table 2). The low processivity of the 
I417V-DNA polymerase means that replication is inhibited by 
subtle DNA damage, for example a phosphotriester, which 
does not impede replication by other DNA polymerases (Tsu- 
jikawa etal., 2003). Because the differences in replication fidelity 
and sensitivity to dNTP pools are observed in vivo for the 
mutant DNA polymerases in the presence of the gp45 clamp, 
intrinsic processivity prevails in the presence of tethering and 
determines the stability of DNA polymerase complexes when 
dNTP pools are low, when DNA templates are damaged or 
difficult sequences are present, and if proofreading will be 
initiated. 

Sequences at preferential termination sites are informative 
because they identify DNA sequences that are difficult for the T4 
DNA polymerase to replicate. Many termination sites occur at sim- 
ple repeats: the -|-46 product terminates within the template ATAT 
sequence and the -1-112 product terminates at the beginning of 
the template GCGC sequence (Figure 1; Reha-Krantz and Nonay, 
1993). Termination with another DNA template was observed at 
the beginning of a template CACACA sequence (Spacciapoli and 
Nossal, 1994). These dinucleotide repeat termination sites are dis- 
tinct from another type of difficult sequence, namely hairpin DNA 
structures that form between inverted repeats in ssDNA ahead of 
an advancing DNA polymerase. The T4 DNA polymerase has lit- 
tle ability to bypass hairpin structures in the absence of T4 ssDNA 
binding protein (gp32) and the clamp (gp45) and clamp loading 
proteins (Huang et al, 1981; Roth et al, 1982; Bedinger et al, 1989; 
Jarvis etal, 1991; Hacker and Alberts, 1994). 

The simple repeat sequences at the termination sites in Figure 1 
likely cause pausing, which causes enzyme dissociation that is 



associated with initiation of the proofreading pathway (see review 
by Reha-Krantz, 2010). While mismatches at the primer-end trig- 
ger proofreading as an error avoidance mechanism (Brutlag and 
Kornberg, 1972; Muzyczka etal, 1972), gratuitous proofreading 
of correctly paired primer ends can be caused by anything that 
hinders continued primer elongation. In contrast to dissociation 
of the T4 DNA polymerase when proofreading is initiated, return 
of the trimmed primer-end to the polymerase active site is rapid 
and processive, even in the absence of tethering (Reddy et al., 1992; 
Fidalgo da Silva and Reha-Krantz, 2007). With respect to in vitro 
applications, less gratuitous proofreading and subsequent disso- 
ciation is observed for the T4 L412M-DNA polymerase than for 
other DNA polymerases; hence, difficult template sequences, low 
dNTP concentrations, and other factors that impede nucleotide 
incorporation will be less problematic for the more processive 
DNA polymerase. 

Biochemical mechanism for increased intrinsic processivity by the 
T4 L412M-DNA polymerase; insights provided by heightened 
sensitivity to PAA 

The L412M substitution converts the T4 DNA polymerase from 
resistance to sensitivity to the herpes virus antiviral drug, phos- 
phonoacetic acid (PAA). PAA-sensitivity is observed for the T4 
L412M-DNA polymerase in vivo (Table 1; Reha-Krantz et al, 1993; 
Li etal, 2010) and in vitro (Reha-Krantz and Nonay, 1994). PAA 
resembles pyrophosphate (PPi), a byproduct of nucleotide incor- 
poration; hence, PAA-sensitivity indicates increased PPi binding. 
PAA-sensitivity can be understood in terms of the nucleotide 
incorporation pathway described in Figure 2 (Reha-Krantz et al., 
2011). Complexlisapre-translocation (Pre-T) complex. Complex 
II is formed by translocation forward by one template posi- 
tion, which removes the PPi binding site present in Complex 
I and creates a new nucleotide binding pocket (pre-insertion 
site). Complexes I and II are in rapid equilibrium (Fidalgo da 
Silva etal., 2002; Hariharan etal., 2006). Nucleotide binding 
produces the open ternary complex. Complex III, and a confor- 
mational change produces the closed ternary complex. Complex 
IV. Nucleotide incorporation takes place in Complex V; then PPi 
is released to form Complex VI, a Pre-T complex like Com- 
plex I except that the primer-end has been extended by one 
nucleotide. 

Complex VI is at the crossroads of three competing pathways. 
The DNA polymerase may continue nucleotide incorporation by 
translocating to form Complex VII, a new post-T complex, or the 
reverse reaction can be initiated by re-binding PPi to form Com- 
plex V and then catalyzing pyrophosphorolysis, which produces 
dNTP and shortens the primer strand by one nucleotide (Complex 
IV). This is the pathway that is sensitive to PPi-like antiviral drugs. 
A third possibility is proofreading, which is initiated if the primer- 
end is mismatched, but gratuitous proofreading is also possible if 
polymerization conditions are not optimal, for example low dNTP 
pools. For proofreading, the end of the primer strand is separated 
from the template and transferred to the exonuclease site. A beta 
hairpin structure located in the exonuclease domain of the T4 
DNA polymerase acts as a wedge to stabilize the exonuclease com- 
plex. Complex VIII (Stocki etal., 1995; Marquez and Reha-Krantz, 
1996; Reha-Krantz, 1998, 2010; Reha-Krantz etal, 1998; Hogg 
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FIGURE 2 I Nucleotide incorporation scheme for the bacteriophage T4 
DNA polymerase. Pre- and Post-T complexes are in equilibrium (Complexes I 
and II). dNTP binding results in formation of an open ternary complex 
(Complex III); a conformation change to evaluate the accuracy of the incoming 
nucleotide results in formation of the closed ternary complex (Complex IV). 
Chemistry takes place to join the incoming nucleotide to the primer end and 
the PPi byproduct is formed (Complex V). PPi dissociates to form Complex VI, 



a new Pre-T, which is at the crossroads of three pathways. Complex VI is in 
equilibrium with the Post-T complex (Complex VII); binding the correct 
nucleotide starts another cycle of nucleotide incorporation. Alternatively, PPi 
may bind to Complex VI to form Complex V, which leads to the reverse 
reaction (pyrophosphorolysis). PAA, as a PPi mimic, also binds to Complex VI. 
A third alternative is proofreading (Complex VIII). The figure is modified from 
Reha-Krantz etal. (2011) and is shown with permission from the ACS. 
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etal, 2004, 2007; Subuddhi etal, 2008). All three pathways are 
in competition, but differences in DNA polymerase interactions 
with the primer-template affect which pathway is chosen. PAA- 
sensitivity indicates that the T4 L412M-DNA polymerase favors 
forming the Pre-T complex, which has a PPi-binding site, over 
the Post-T complex, which has a dNTP binding site. Although 
PPi-like antiviral drugs could conceivably inhibit replication by 
interfering with dNTP binding in Post-T complexes, structural 
studies show that PPi-like phosphonoformic acid (PFA) traps a 
mutant RB69 DNA polymerase in a complex that resembles Com- 
plex V (Zahn etal., 2011) as expected if PFA binds to the Pre-T 
complex. 

Formation of Pre- and Post-T complexes can also be observed in 
assays that use the fluorescence of the base analog 2AP. Pre-T com- 
plexes have higher fluorescence intensity than Post-T complexes 
for DNA substrates with 2AP positioned in the -|-1 position in the 
template strand, adjacent to the terminal base pair (Mandal etal., 
2002; Hariharan and Reha-Krantz, 2005; Reha-Krantz et al., 2011). 
The higher fluorescence intensity of polymerase complexes formed 
with the L412M-DNA polymerase indicates that the L412M-DNA 
polymerase favors formation of the highly fluorescent Pre-T com- 
plexes over Post-T complexes in the absence of PPi and dNTPs 
(Hariharan and Reha-Krantz, 2005). The role of the Pre-T complex 
in drug sensitivity is further substantiated by studies that show that 
PFA inhibits the HIV-1 reverse transcriptase by trapping a Pre-T 
complex (Marchand et al., 2007). We extend these findings by link- 
ing PAA-sensitivity of the L412M-DNA polymerase to increased 
intrinsic processivity, which suggests that Pre-T complexes are 
more stable, and less subject to enzyme dissociation than Post-T 
complexes (see further discussion by Li etal, 2010). 

Increased pyrophosphorolysis is observed for the T4 L412M-DNA 
polymerase 

Increased pyrophosphorolysis is expected for PAA-sensitive DNA 
polymerases because the equilibrium between Pre- and Post-T 
complexes favors Pre-T complexes, which can bind PPi (Figure 2). 
The PAA-sensitive T4 L412M-DNA polymerase catalyzes a robust 
pyrophosphorolysis reaction. Section "Materials and Methods" for 
assay conditions. The pyrophosophorolysis rate in the presence of 
2.5 mM PPifor the exonuclease-deficient (D112A/E1 14A) L412M- 
DNA polymerase is fivefold higher than the rate detected for 
the exonuclease-deficient T4 DNA polymerase with the wild type 
Motif A sequence and more than 80-fold higher than the less pro- 
cessive, exonuclease-deficient I417V-DNA polymerase. The K^^ 
for PPi for the exonuclease-deficient L4 1 2M-DNA polymerase was 
~0.5 mM compared to 2 mM for the exonuclease-deficient DNA 
polymerase with the wild type Motif A and was too high to be 
measured for the exonuclease-deficient 14 17V- DNA polymerase 
(Damaraju and Reha-Krantz, unpublished observations). 

Mechanisms to curb pyrophosphorolysis catalyzed by the 
L412M-DNA polymerase; formation of divalent cation-PPi 
complexes 

Pyrophosphorolysis is the reversal of the polymerization reaction, 
but pyrophosphorolysis is usually not a problem during DNA 
replication in vivo because iPPase degrade PPi to prevent build-up 



of the high concentrations of PPi needed to drive pyrophos- 
phorolysis. But high concentrations of PPi formed during DNA 
sequencing reactions support pyrophosphorolysis, which results 
in the degradation of chain-terminated sequencing products and 
ambiguous sequencing data (Figure 3). The addition of iPPase to 
sequencing reactions is often used to prevent pyrophosphorolysis 
(Tabor and Richardson, 1990), and this is successful in sequencing 
reactions catalyzed by the L412M-DNA polymerase, but divalent 
metal ions - Mn^+ and Ca^+, are equally effective when com- 
bined with Mg~+ in the presence of 15 mM Na citrate (Figure 3; 
Damaraju and Reha-Krantz, unpublished observations). 

The mechanism of inhibition of pyrophosphorolysis by diva- 
lent metal cations is complex because, as demonstrated for yeast 
iPPase (Ridlington and Butler, 1972), while some divalent metal 
ions such as Mg^+ are needed for catalysis (activators), other 
metal ions such as Mn^+ or Ca^+ may inhibit activity by binding 
to the enzyme or DNA or by forming complexes with PPi. We 
observed this complexity for the nucleotide incorporation and 
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FIGURE 3 1 Pyrophosphorolysis detected 


in DNA sequencing reactions 


with FIDELITY™. 


(A) Three sets of DNA sequencing reactions were 


performed (see Materials and Methods): without a pyrophosphorolysis 


inhibitor (minus iPPase), with added yeast inorganic pyrophosphatase (plus 


iPPase), 


and with manganese. Results similar to the manganese 


supplemented reactions were detected with calcium. No or faint bands 


were detected for several products terminating inT in the minus iPPase 


reactions 


. Reactions terminating 


in T for all sets of reactions are shown to 


the right. 


missing or faint reaction products terminating in T are indicated by 


a capita 


etter to the right of the 


lanes. (B)The DNA sequence for the 


reactions shown in panel A. BoldTs indicate missing or faint terminating Ts; 


the letter code identifies pyrophosphorolysis sensitive sites. The sequences 


are analyzed inTable 3. Pyrophosphorolysis-resistantTs are indicated by a 


number below the sequence. 
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12 3 4 

Mn2+ concentration (mM) 

FIGURE 4 I Inhibiting pyropliospliorolysis with manganese. Increased 
manganese is needed to inhibit the pyrophosphorolysis reaction catalyzed 
by the exonuclease-deficient L412M-DNA polymerase when the 
concentration of PPi is increased. 



pyrophosphorolysis reactions catalyzed by the T4 exonuclease- 
deficient L412M-DNA polymerase (Damaraju and Reha-Krantz, 
unpublished data). Either Mg^+ or Mn^+ can support nucleotide 
incorporation and pyrophosphorolysis reactions, but Mg^+ sup- 
ports both reactions optimally at concentrations from 1 to > 5 mM, 
while Mn^+ is only half as effective as Mg^+ and over a narrow 
concentration range that peaks at about 0.5 mM. In the pres- 
ence of 15 mM Na citrate, however, Mn^+ supports nucleotide 
incorporation over a broad concentration range up to 5 mM, 
but the pyrophosphorolysis reaction is supported only over the 
narrow range from 0.5 to 1 mM. Thus, Na citrate protects 
the nucleotide incorporation reaction at high concentrations of 
Mn^+, but not the pyrophosphorolysis reaction. This means that 
pyrophosphorolysis catalyzed DNA degradation can be avoided in 
DNA sequencing reactions by the addition of 15 mM Na citrate 
and 3 to 5 mM Mn^+, but note further improvements discussed 
below. 

Na citrate protects the nucleotide incorporation reaction by 
functioning as a weak chelator, which reversibly binds divalent 
metal cations; however, PPi also forms complexes with metal 
cations. This point is demonstrated by the observation that higher 
amounts of Mn^+ ions are needed to inhibit pyrophosphoroly- 
sis by the T4 L412M-DNA polymerase as the concentration of 
PPi is increased (Damaraju and Reha-Krantz). For example, in 
the presence of 1 mM PPi, ~ 1 mM Mn^+ produces 50% inhi- 
bition of pyrophosphorolysis; for 2 mM PPi, ~2 mM Mn^+ is 
needed for 50% inhibition; and for 4 mM PPi, >3 mM Mn^+ 
is required to achieve the same level of inhibition (Figure 4). 
The stoichiometry of the reaction is consistent with formation 
of Mn-PPi chelate complexes or in the case of Ca^+, Ca-PPi 
chelate complexes. Thus, there is a complex equilibria involving 
one or more metal ions (Me): DNA pol-Me, citrate-Me, PPi- Me, 
free Me^+, free DNA pol, free PPi, free Na citrate, and active 
and inactive DNA pol- Me-PPi complexes. While Mn-PPi bind- 
ing may inactivate pyrophosphorolysis directly, this is not the 
most likely explanation for inhibition in reactions with Na cit- 
rate and Mn^+ above 3 mM because nucleotide incorporation 
is not inhibited; Mn-PPi binding would be expected to inhibit 
both reactions. Another explanation is that Mn^+ sequesters 
PPi and reduces the high concentration of PPi needed for the 
pyrophosphorolysis reaction, but there is sufficient free Mn^+ 
(~0.5 mM) to support, but not to inhibit, the nucleotide incor- 
poration reaction. The ability of Na citrate to maintain low 
concentrations of Mn^+ has been proposed by others (Beck- 
man etal., 1985; Tabor and Richardson, 1989, 1990), but in 
this case Mn^+ was thought to improve the evenness of band 
intensities (DNA sequencing products) by reducing discrimi- 
nation against chain-terminating nucleotides. Ca^+ can also 
chelate PPi and effectively prevent pyrophosphorolysis (results not 
shown). 

Although conditions were optimized for Mn^+-dependent 
nucleotide incorporation and minimal pyrophosphorolysis, 
nucleotide incorporation was only 60% of what is observed 
in Mg^+ -dependent reactions and Ca^+ does not support 
the nucleotide incorporation reaction catalyzed by T4 DNA 
polymerase. Optimal nucleotide incorporation and minimal 
pyrophosphorolysis activity were obtained in mixed, divalent 



metal reactions with Mg^+ and either Mn^+ or Ca^+ in the pres- 
ence of Na citrate. For DNA sequencing using FIDELITY^'^ (see 
Materials and Methods), pyrophosphorolysis was essentially elim- 
inated in reactions with 16.7 mM Mg^+, 15 mM Na citrate and 
either 2.5 mM Mn^+, or 10 mM Ca^+ without adverse effects on 
DNA sequencing efficiency (Figure 3). 

DNA sequence context effects on ttie pyroptiosptiorolysis reaction 
catalyzed by the T4 L412M-DNA polymerase 

DNA sequence effects have been reported for the pyrophospho- 
rolysis reactions catalyzed by the Klenow fragment of E. coli 
DNA polymerase I (Mizrahi etal, 1986) and the phage T7 DNA 
polymerase (Tabor and Richardson, 1990), but systematic stud- 
ies were not carried out to determine if specific DNA sequences 
are required for pyrophosphorolysis. We observed both general 
and specific DNA sequence contexts for the pyrophosphorolysis 
reactions catalyzed by the exonuclease-deficient T4 wild type and 
L412M-DNA polymerases. 

Conditions that favor formation of Pre-T complexes (Com- 
plex VI, Figure 2) are predicted to increase pyrophosphorolysis 
because PPi can bind to this complex. In previous studies using 
the fluorescence of the base analog, 2AP, more Pre-T than Post- 
T complexes are formed with A-|-T-rich than G-|-C-rich DNA 
substrates (Hariharan and Reha-Krantz, 2005; Hariharan etal., 
2006). Thus, pyrophosphorolysis is predicted to favor DNA 
substrates with A-|-T-rich primer-terminal regions. This was 
observed in reactions with A-l-T- and G-|-C-rich DNAs labeled 
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with 2AP at the 3'-primer end (see Materials and Methods). 
The rate for release of the terminal d2APTP by the exonuclease- 
deficient L412M-DNA polymerase in the presence of 2.5 mM 
PPi was sixfold faster for the A+T-rich DNA compared to the 
G+C-rich DNA, 14 s^^ compared to 2.4 s^^ (Damaraju and 
Reha-Krantz). The exonuclease-deficient T4 DNA polymerase 
with the wild type Motif A sequence also favored A+T-rich 
DNA, but with slower rates, 3 and 0.5 s^' for A-l-T- and 
G-|-C-rich DNAs, respectively. However, while pyrophospho- 
rolysis was observed for DNA substrates with a terminal 2AP 
opposite template T, pyrophosphorolysis was not detected for 
2AP opposite template C as expected if pyrophosphorolysis 
requires a matched primer-end. DNA substrates with A-|-T-rich 
primer-terminal regions also favor the exonucleolytic proofread- 
ing pathway (Bessman and Reha-Krantz, 1977; Bloom etal., 
1994; Reha-Krantz, 2010). Thus, increased formation of Pre- 
T complexes, which likely reflects decreased ability to form 
Post-T complexes, increases the likelihood of proofreading or 
pyrophosphorolysis. Proofreading is favored if the primer-end is 
mismatched and pyrophosphorolysis is observed if sufficient PPi 
is present. 

Specific DNA sequences were observed to promote pyrophos- 
phorolysis under DNA sequencing conditions using the 
fidelity'"''^ manual DNA sequencing kit developed by ONCOR, 
Gaithersburg, MD, USA. Reactions contained the exonuclease 
deficient L412M-DNA polymerase, the gp45 processivity fac- 
tor, the gp45 loading proteins as well as the T4 ssDNA binding 
protein, gp32, to prevent formation of DNA hairpin structures 
in ssDNA. Reaction products were internally labeled with a^^P- 
dATP; the template was single-stranded circular DNA. Instead of 
2',3'-dideoxy nucleoside triphosphate terminators (ddNTPs), 3'- 
amino-2',3'-dideoxy nucleoside triphosphates (3'NH2ddNTPs) 
were used because the T4 DNA polymerase showed less dis- 
crimination. 3'NH2ddNTPs can be purchased from TriLink 
BioTechnologies, San Diego, CA, USA. All reaction compo- 
nents and procedures are described in Section "Materials and 
Methods." Four separate reactions were run, each with a different 
chain-terminator. The reaction products were separated by 
polyacrylamide gel electrophoresis under denaturing conditions 
(Figure 3A). In the absence of agents that reduce pyrophospho- 
rolysis - iPPase or manganese (Mn^ +), there are missing bands due 
to pyrophosphorolysis as well as many faint bands (Figures 3A,B) . 

It is readily apparent that not all chain-terminated primer ter- 
mini are equally sensitive to pyrophosphorolysis; only primer 
ends ending in "T" were subject to pyrophosphorolysis, but 
not all terminal Ts (see right side of Figure 3A). Examina- 
tion of 200 nucleotides of DNA sequence (Figure 3B) with 54 
sequencing products that terminated in T revealed that only 12 
of the terminal Ts were subject to pyrophosphorolysis. What 
is different about the DNA sequences for pyrophosphorolysis- 
sensitive and insensitive sites? There are three related sequences 
in the primer-template region that promote pyrophosphorol- 
ysis (Table 3); the sequences differ only in the base pairs at 
the —1 and —2 positions. Sequence 1 has a GtemplateCprimer 
base pair in the —2 position; sequence 2 has a QemplateGprimer 
base pair in the —1 position, but not in the —2 position; and 
sequence 1/2 has a GtemplateCprimer base pair in the —2 and 



a QemplateGprimer base pair in the —1 position. A template 
"G" was never observed in the -|-1 position in the template 
strand, even though statistically 2 or 3 were expected in the 1 1 
sequences examined. Note that a "G" in the -|-1 position for 
a terminal T at position 26 in the DNA sequence (Figure 3B) 
stabilized the T to pyophosphorlysis even though the remain- 
der of the consensus sequence is present. Similarly, a template 
"A" was not observed in the +4 position, but an A stabilized 
the Ts at DNA sequence positions 5 and 8. The consensus 
sequences 1, 2 and 1/2 (Table 3) are required to sensitize termi- 
nal Ts to pyrophosphorolysis, but one exception was observed, 
see Ea in Figure 3. Although the consensus sequence is not 
present, the terminal T is just before a dinucleotide GT repeat, 
which may stall formation of Post T complexes and, thus, 
promote pyrophosphorolysis. Thus, the pyrophosphorolysis sen- 
sitive sites observed in Figure 3 may be an underestimation of 
the number of sensitive sites. Nevertheless, the strong bias for 
pyrophosphorolysis for only primers ending in "T" and only for 
a subset of terminal "Ts" indicates sequence specificity for the 
pyrophosphorolysis reaction catalyzed by the T4 L412M-DNA 
polymerase. 

Increased incorporation of fluoropliore-labeled nucleotides 

The L412M-DNA polymerase has increased ability to incorpo- 
rate modified nucleotides compared to the wild type T4 DNA 
polymerase, exonuclease-deficient T4 DNA polymerases and 
other mutant T4 DNA polymerases and DNA polymerases from 
other organisms (Goodman and Reha-Krantz, 1999). Reactions 
showing incorporation and extension of a variety of modified 
nucleotides with the L412M- and exonuclease-deficient L412M- 
DNA polymerase are shown in US patent 5945312. The modified 
nucleotides tested include, but are not limited to rhodamine- 
dUTP, fluorescein-dUTP, rhodamine-dCTP, biotin-dCTP and 
DIG-dCTR For all modified nucleotides, the L412M-DNA poly- 
merase performed significantly better than the wild type T4 DNA 
polymerase or DNA polymerases from other organisms. The T4 
L4 1 2M-DNA polymerase was used to prepare fluorescently labeled 
DNA with one or two fluorophore-labeled bases for proof-in- 
principle testing of DNA sequencing by exonuclease digestion 
proposed by Keller and colleagues (Goodwin etal, 1995; Werner 
etal, 2003). 

THE PHAGE T4 EXONUCLEASE-DEFICIENT L412M-DNA POLYMERASE AS 
A DNA SEQUENCING ENZYME: FIDELITY™ 

The FIDELITY^M DNA sequencing kit (ONCOR, Gaithersburg, 
MD, USA) was developed for routine DNA sequencing, but espe- 
cially for sequencing difficult DNA sequences. The use of the 
exonuclease-deficient L412M-DNA polymerase with increased 
intrinsic processivity plus the addition of T4 processivity fac- 
tors and ssDNA binding protein provided the means to sequence 
all types of difficult DNA sequences with greater success than 
achieved by other methods as judged by testimonials and appli- 
cations, for example, see Devireddy and Jones (1998) and TuUius 
etal. (2001). An example of routine DNA sequencing is shown in 
Figure 3 and examples of sequencing difficult DNAs are shown 
in Figure 5. Note the clean sequence for the G-|-C-rich template 
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Table 3 | DNA sequences that promote pyrophosphorolysis by the phage T4 L412M-DNA polymerase. 
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^ n, terminal base pair. 

is any nucieotide: A, C, G, T. 
Chain terminator 

^ Letter code that corresponds to faint or missing Ts in the sequencing reactions shown in Figure 3. 
Bold indicates highly conserved sequence. 
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FIGURE 5 I Sequencing difficult DIMAs with FIDELITY™. The 

combination of the exonuclease-deficient L412M-DNA polymerase, theT4 
clamp and clamp loading proteins, and theT4 single-stranded DNA binding 
protein provides the means to sequence difficult DMAs with high G+C 
content (A) and tracks of mononucleotide repeats (B). 



(Figure 5A) and for a template with several tracks of C repeats 
(Figure 5B). 

USING GENETIC APPROACHES TO IDENTIFY "BIOTECH " DNA 
POLYMERASES: IN VIVO AND IN VITRO SELECTION METHODS 

Genetic selection methods are powerful tools to identify mutant 
DNA polymerases on the basis of phenotype without the need 
to have a complete understanding of structure or function. The 
advantage of genetic selection strategies is that if the stringency 
is sufficiently strong, then only mutants with the desired pheno- 
type will survive. This expedites "finding a needle in a haystack." 
The phage T4 L412M-DNA polymerase was the result of two 
consecutive genetic selection schemes. 

First, a genetic selection was carried out for the identification 
of mutant T4 DNA polymerases with a strong mutator pheno- 
type. Details for the selection of mutator T4 DNA polymerases 
(Reha-Krantz etal, 1986; Reha-Krantz, 1988) and a similar strat- 
egy for mutator yeast DNA polymerase 8 mutants (Murphy etal, 
2006) are described. Several mutator T4 DNA polymerases were 
identified, but not the L412M-DNA polymerase because this 
mutant replicates DNA with relatively high fidelity as discussed 
(Table 2). For many isolates there was more than one muta- 
tion in the DNA polymerase gene. Single mutant strains had to 
be constructed in order to determine if one or more mutations 
were required to confer the mutator phenotype and to rule out 
contributing mutations in other genes. For one isolate, called 
mel5, there were 11 mutations in the DNA polymerase gene. 



but only the mutation encoding the D131G amino acid substi- 
tution was required for mutator activity (Reha-Krantz, 1988). 
The D131G-DNA polymerase has strongly reduced ability to 
form exonuclease complexes and, as a consequence, has severely 
reduced 3' — ^5' exonuclease activity (Baker and Reha-Krantz, 
1998). 

Why were there so many mutations in the mel5 strain? One 
possibility is that error-prone replication by mutator DNA poly- 
merases created many mutations within the DNA polymerase 
gene and in other genes during propagation, but a more likely 
explanation is that additional DNA polymerase mutations were 
selected to temper the mutational burden produced by the D131G 
amino acid substitution. This proposal is supported by the 
presence of a mutation in the mel5 strain that encodes the 
I417V substitution in Motif A (Table 1). The I417V-DNA poly- 
merase has an antimutator phenotype, increased proofreading, 
and low intrinsic processivity (Reha-Krantz and Nonay, 1994; 
Figure 1 and Table 2), activities which would help to maxi- 
mize residual exonuclease activity in the presence of the D131G 
substitution and increase replication fidelity. These activities, 
however, would also help to improve maturation of Okazaki 
fragments. Okazaki fragment maturation requires the coordi- 
nated action of DNA polymerase 3' — >5' exonuclease activity 
with a 5' — >3' exonuclease, ExoI/RNaseH in T4 and Rad27 
in yeast, to create a nick that can be sealed by DNA lig- 
ase (Jin etal, 2004; reviewed by Reha-Krantz, 2010). In the 
absence of DNA polymerase 3' — >5' exonuclease activity, the 
DNA polymerase catalyzes strong displacement synthesis at junc- 
tions between Okazaki fragments that creates 5' — >3' flaps that 
prevent ligation unless the flap is removed. Persisting unjoined 
Okazaki fragments are dangerous because strand discontinu- 
ities result in double strand breaks in the next replication cycle. 
Thus, there are at least two strong selective pressures exerted 
to decrease the negative effects of reduced DNA polymerase 
proofreading. 

The second selection to identify the L412M-DNA poly- 
merase was to select for suppressors of the low processivity 
and high proofreading activity of the 14 17V- DNA polymerase 
(Table 1 and Figure 1). One consequence of low proces- 
sivity is the need for high dNTP pools to sustain replica- 
tion. The I417V-DNA polymerase could not replicate DNA 
when the dGTP pool was reduced, which allowed for selec- 
tion of second site mutations that allowed replication under 
low dGTP conditions (Reha-Ki'antz and Nonay, 1994; Stocki 
etal, 1995; Li etal, 2010). Mutations encoding the L412M 
substitution in Motif A and elsewhere in the DNA polymerase 
gene were identified (Stocki etal, 1995; Li etal, 2010). Thus, 
the mutation encoding the L412M substitution and functionally 
equivalent amino acid substitutions are expected to be iden- 
tified as suppressors of mutant DNA polymerases with low 
processivity. 

Selection schemes can also be done in vitro to identify mutant 
DNA polymerases with desired properties for biotech applications. 
Bourn etal. (2011) describe an in vitro strategy beginning with a 
chimeric DNA polymerase derived from Pyrococcus kodakaraensis 
and Pyrococcus furiosus, which was named Kofu DNA polymerase. 
Several mutant DNA polymerases with increased processivity 
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and other desirable features were identified, but many had 
multiple mutations as we observed for in vivo selections with 
the T4 DNA polymerase. There are no reports to link amino 
acid substitutions in the Kofu DNA polymerase with specific 
polymerase properties and the amino acid substitutions in the 
DNA polymerase marketed as KAPA HiFi DNA polymerase by 
KAPA Biosystems Ltd., South Africa, are proprietary. Bourn 
etal. (2011) suggest, however, that advantageous properties may 
require the combined action of several amino acid substitutions. 
None of the amino acids disclosed, however, are in Motif A, 
but there could be amino acid substitutions in other DNA poly- 
merase regions that affect processivity, for example see Stocki 
etal. (1995), Reha-Krantz and Wong (1996), and Li etal. (2010). 
While our genetic selections were done in vivo, in vitro strate- 
gies similar to Bourn etal. (2011) can be envisioned that exploit 
sensitivities to dNTP pool concentrations, PAA or PPi can be 
envisioned. 

We have applied information learned about T4 DNA poly- 
merase function to other DNA polymerases, namely yeast DNA 
polymerases alpha and delta. Amino acid changes in the yeast 
DNA polymerases that are analogous to the L412M substitution 
in the T4 DNA polymerase produce PAA-sensitivity (Table 1; Li 
etal., 2005). Because PAA-sensitivity is correlated with increased 
intrinsic processivity, the mutant yeast DNA polymerases are also 
predicted to have increased processivity; however, the mutant 
yeast DNA polymerases are more error prone than the L412M- 
DNA polymerase (Table 2; Niimi et al, 2004; Li et al, 2005). The 
differences in replication fidelity for the T4 and yeast DNA poly- 
merases suggest differences in the equilibria between Pre- and 
Post-T complexes and in forming exonuclease complexes. How- 
ever, fine tuning this balance can be achieved by testing other 
amino acid substitutions in Motif A and by combining amino 
acid substitutions. For example, the phage T4 L412M/I417V-DNA 
polymerase is PAA-sensitive, although less than the L412M-DNA 
polymerase, but the double mutant has an antimutator phenotype 
and slightly increased proofreading compared to the wild type T4 
DNA polymerase (Tables 1 and 2). Thus, single or multiple amino 
acid changes within the Motif A sequence of DNA polymerases in 
general may produce mutant DNA polymerases with the optimal 
balance of intrinsic processivity and proofreading that will allow 
replication of difficult DNA sequences and increased utilization of 
modified nucleotides while maintaining replication fidelity. 

DISCUSSION 

The phage T4 exonuclease-deficient L412M-DNA polymerase is 
proven to be an excellent DNA sequencing enzyme especially when 
combined with processivity factors (clamp and clamp loaders) 
and ssDNA binding protein; this is the basis of FIDELITY™. 
AH DNAs tested, including DNAs with long stretches of mono- 
and di-nucleotide repeats and high A-l-T- or G-|-C-content are 
sequenced cleanly (Figure 5; S. Woodgate personal observations). 
While the T4 L412M-DNA polymerase remains an attractive can- 
didate as a DNA sequencing enzyme, other DNA polymerases may 
have additional desirable properties that would profit by modifi- 
cations to the Motif A sequence. Indeed, in addition to our studies 
of Motif A in phage T4 and yeast DNA polymerases (Reha-Krantz 
and Nonay, 1 994; Li et al., 2005; Table 1 ) , Motif A in family B and A 



DNA polymerases is surprisingly amenable to engineering despite 
being a conserved sequence (Table 1; Patel and Loeb, 2000; Niimi 
et al, 2004; Venkatesan et al, 2006, 2007). Our studies demonstrate 
that Motif A functions in determining intrinsic processivity for the 
phage T4 DNA polymerase (Figure 1) and, by extrapolation, to 
other DNA polymerases. The level of intrinsic processivity deter- 
mines the ability of DNA polymerases to replicate difficult DNA 
sequences and to incorporate modified nucleotides; these prop- 
erties are maintained even when dNTP pools are reduced for the 
L412M-DNA polymerase (Figure 1). Different amino acid sub- 
stitutions and combinations of substitutions in Motif A allow 
fine-tuning to achieve the optimal balance of intrinsic processivity 
for the replication of difficult DNA templates while maintain- 
ing high replication fidelity; these are highly desirable properties 
for sequencing, amplification and labeling technologies. We focus 
here on Motif A because this motif is conserved in all DNA and 
RNA polymerases, which means that site-directed mutagenesis of 
Motif A can be used to identify mutant DNA polymerases with 
increased intrinsic processivity that still retain adequate proof- 
reading activity. However, other DNA polymerase regions also 
affect intrinsic processivity (Reha-Krantz etal., 1993; Stocki etal., 
1995; Li etal., 2010) and, thus, are attractive secondary mutational 
targets. 

Increased intrinsic processivity increases pyrophosphorolysis, 
but methods were shown to curb pyrophosphorolysis activity of 
the exonuclease-deficient L412M-DNA polymerase by the addi- 
tion of iPPase or divalent metal ions, Mn2+ or Ca'+ (Figure 3). 
In the absence of these agents, pyrophosphorolysis under DNA 
sequencing conditions displayed sequence specificity. Pyrophos- 
phorolysis was detected only for a subset of sequencing products 
terminated with T (Figure 3). Although pyrophosphorolysis by 
the T4 wild type and L412M-DNA polymerases was higher for 
DNA substrates with A-l-T- compared to G-|-C-rich primer ter- 
minal regions, the sequence specificity observed under DNA 
sequencing conditions was specific for primer-ends with the T- 
terminating nucleotide plus either a CtempiateGprimer base pair in 
the — 1 position or a GtemplateCprimer base pair in the —2 position 

or a CtempiateGprimer in the —1 and a GtemplateCprimer in the —2 

position (Table 3). Dinucleotide repeats may also sensitize termi- 
nal Ts, see the pyrophosphorolysis sensitive site Ea in Figure 3 
and Table 3. The preference of pyrophosphorolysis for A-|-T-rich 
primer-terminal regions is expected for the T4 DNA polymerase, 
because A-|-T-rich primer-termini favor formation of Pre-T com- 
plexes (Hariharan and Reha-Krantz, 2005), which can bind PPi 
and PPi-like antiviral drugs (Figure 2). The sequence-dependence 
for pyrophosphorolysis under DNA sequencing conditions must 
also indicate sequences that favor Pre-T complexes, but the 
mechanism is unclear. Unfortunately, while there are numerous 
structural studies of the RB69 DNA polymerase (for example 
Franklin et al, 2001; Hogg et al, 2004; Wang et al, 201 1), which is 
a close relative of the phage T4 DNA polymerase (Hogg etal., 
2006), the primer-template DNAs used in crystallography did 
not have the pyrophosphorolysis-sensitive sequences indicated in 
Table 3; thus, a direct test of DNA polymerase interactions with 
the pyrophosphorolysis sequences has not been done. However, 
the DNA substrate used to capture a mutant RB69 DNA poly- 
merase with the PPi analog, PEA (foscarnet), was A-|-T-rich except 
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for the terminal acyclo guanine (Zahn et al., 20 1 1 ), which is consis- 
tent with our data that demonstrate increased pyrophosphorolysis 
with A+T-rich DNAs. 

We propose that the sequence specificity for the pyrophos- 
phorolysis reaction under DNA sequencing conditions is caused 
by subtle interactions of the DNA polymerase with a dynamic 
primer-terminal region that breathes more or less depending 
on the DNA sequence, but we do not rule out the possibil- 
ity of base recognition, especially for bases in the -|- 1 and +4 
positions in the template strand (Table 3). Breathing in the ter- 
minal region is proposed to explain why A-|-T-richness favors 
proofreading by increasing the strand separation needed to form 
exonuclease complexes (for example, see Bloom etal., 1994 and 
Bessman and Reha-Krantz, 1977); we suggest here that breathing 
also affects the rapid equilibrium between Pre- and Post-T com- 
plexes and increased breathing favors Pre-T complexes. Breathing 
has been detected at the primer-terminal junction by fluores- 
cence studies that demonstrate measurable unwinding ~2 base 
pairs into the duplex region (lose etal., 2009), which may be 
exacerbated by DNA polymerase interactions. But how does the 
DNA polymerase distinguish between opting for pyrophosphorol- 
ysis compared to proofreading? Proofreading a chain-terminated 
primer-end may always be preferred, but the DNA polymerase in 
sequencing reactions is exonuclease-deficient which allows detec- 
tion of pyrophosphorolysis, but pyrophosphorolysis is not simply 
a default pathway because sequence specificity is observed. The 
pyrophosphorolysis-sensitive DNA sequences (Table 3) must sta- 
bilize formation of stable Pre-T complexes (reduce dissociation) at 
the expense of Post-T complexes. In other words, sequences that 
promote pyrophosphorolysis must hinder translocation to form 
Post-T complexes. 

A critical step in translocation may involve interactions of 
the DNA polymerase at the — 1 position in the primer-terminal 
region, using the naming system indicated in Table 1 with "n" 
designating the terminal base pair. Structural studies show that 
a conserved lysine residue intercalates into the primer-terminal 
region at the — 1 position of ternary complexes formed with the 
RB69 DNA polymerase (Franklin etal, 2001), but in complexes 
trapped with the PPi mimic PFA (Zahn et al., 2011), translocation 
was blocked after nucleotide incorporation and lysine intercala- 
tion did not change, which meant intercalation was observed at 
the —2 position. Thus, lysine intercalation advances step wise after 
each nucleotide is incorporated with intercalation at the —2 base 
pair position in Pre-T complexes before translocation and advanc- 
ing to the — 1 base pair position to form Post-T complexes. The — 1 
and —2 positions are also critical for promoting pyrophosphorol- 
ysis; a GtemplateCprimer base pair is observed in the —2 position of 
sequences that promote pyrophosphorolysis (sequence 1) unless 
there is a QemplateGprimer base pair in the —1 position (sequence 
2) or two GC and CG base pairs are observed in the —2 and 
— 1 positions, respectively (Table 3). If localized breathing in the 
primer- terminal region is the critical factor for forming stable Pre- 
T complexes, disfavoring exonuclease complexes, and hindering 
translocation to form Post-T complexes, then the requirement of 
GC and CG base pairs at the —2 and — 1 positions suggests that GC 
base pairs impart stability that reduces breathing and strand sep- 
aration needed to form exonuclease complexes. But what hinders 



formation of Post-T complexes? One possibility may reflect the 
effect of duplex stability on the ability of the conserved lysine 
residue to intercalate into duplex DNA if the ease of intercalation 
is affected by A-|-T/G-|-C-richness. Another possibility may be that 
sequences which promote pyrophosphorolysis cause a slight dis- 
tortion of the DNA helix. This possibility is suggested because 
not any GC base pair is sufficient, but only a GtemplateCprimer 
base pair in the —2 position or a CtemplateGprimer base pair in 
the — 1 position, not the reverse Watson-Crick base pairs. There is 

also specificity for A,emplateTprimer Or C template Gprimer base pairs 

in the —3 position. While base recognition has not been ruled 
out, subtle physical changes in the primer-terminal region appear 
to affect the equilibria between Pre- and Post-T complexes and 
exonuclease complexes. This suggestion is in keeping with try- 
ing to understand the underlying mechanism to explain how the 
conservative L412M substitution in the T4 DNA polymerase has 
such profound effects on processivity and sensitivity to the PPi- 
mimic PAA (Figure 1; Reha-Krantz and Nonay, 1994) while the 
L412I substitution also has profound effects, but in the opposite 
direction. Structural studies of the analogous RB69 L415M-DNA 
polymerase of Post-T ternary complexes with dNTP (Xie et al., 
2013) or Pre-T complexes trapped with PFA (Zahn etal., 2011) 
do not shed light on mechanism, which then leaves open the pos- 
sibility that rapidly changing conformations not yet captured in 
structural studies are behind the observed characteristics of the T4 
L412M- and L412I-DNA polymerases. 

Future studies are needed to determine if the pyrophosphoroly- 
sis-sensitive sequences described in Table 3 promote pyrophos- 
phorolysis in the absence of a chain-terminating T or if the 
chain-terminator contributes to the observed sequence specificity. 
DNA binding studies with the primer-templates described in 
Table 3 are also needed. Studies are also needed to explore the 
possibility that bases in the -|-1 and +4 positions in the template 
are recognized by the T4 DNA polymerase. The ability of "A" in the 
+4 position in the template strand to prevent pyrophosphorolysis 
suggests that the DNA polymerase can detect sequence. Interest- 
ingly, the Pfu DNA polymerase detects deaminated bases in the +4 
position as part of an apparent error avoidance mechanism (Con- 
nolly, 2009). The sequence specificity of the pyrophosphorolysis 
reaction suggests possible implications for antiviral drug therapy. 
For example, is the sensitivity of herpes viral DNA polymerases 
to PPi analogs, PAA/PFA/foscarnet, dependent on DNA sequence? 
See additional discussion by Li et al. (2010). 

In conclusion, amino acid substitutions in the phage T4 DNA 
polymerase Motif A increase or decrease intrinsic processivity by 
altering the equilibria between Pre- and Post-T complexes and for- 
mation of exonuclease complexes. Amino acid changes in Motif 
A affect intrinsic processivity by increasing the stability of Pre-T 
complexes. Increased stability of Pre-T complexes means increased 
ability to replicate difficult DNA sequences and to incorporate 
modified nucleotides. Motif A is an attractive target for engineer- 
ing "biotech" DNA polymerases because intrinsic processivity can 
be fine-tuned by using different amino acid substitutions and com- 
binations of amino acid substitutions to optimize replication of 
difficult DNA sequences and to enhance the ability to use mod- 
ified nucleotides while maintaining replication fidelity. Another 
important take-home message is to acknowledge that DNA and 
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DNA polymerase-DNA interactions are dynamic, especially in 
the primer-terminal region (Hariharan and Reha-Krantz, 2005; 
Hariharan etal., 2006; Jose etal., 2009). Lastly, DNA polymerases 
may have more ability to detect specific sequences than previously 
recognized. 
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